Extracting Predominant Local Pulse Information From Music Recordings
Identifieur interne : 000381 ( Main/Exploration ); précédent : 000380; suivant : 000382Extracting Predominant Local Pulse Information From Music Recordings
Auteurs : Peter Grosche [Allemagne] ; Meinard Miiller [Allemagne]Source :
- IEEE transactions on audio, speech, and language processing [ 1558-7916 ] ; 2011.
Descripteurs français
- Pascal (Inist)
English descriptors
- KwdEn :
Abstract
The extraction of tempo and beat information from music recordings constitutes a challenging task in particular for non-percussive music with soft note onsets and time-varying tempo. In this paper, we introduce a novel mid-level representation that captures musically meaningful local pulse information even for the case of complex music. Our main idea is to derive for each time position a sinusoidal kernel that best explains the local periodic nature of a previously extracted note onset representation. Then we employ an overlap-add technique accumulating all these kernels over time to obtain a single function that reveals the predominant local pulse (PLP). Our concept introduces a high degree of robustness to noise and distortions resulting from weak and blurry onsets. Furthermore, the resulting PLP curve reveals the local pulse information even in the presence of continuous tempo changes and indicates a kind of confidence in the periodicity estimation. As further contribution, we show how our PLP concept can be used as a flexible tool for enhancing tempo estimation and beat tracking. The practical relevance of our approach is demonstrated by extensive experiments based on music recordings of various genres.
Affiliations:
Links toward previous steps (curation, corpus...)
- to stream PascalFrancis, to step Corpus: 000007
- to stream PascalFrancis, to step Curation: 000007
- to stream PascalFrancis, to step Checkpoint: 000006
- to stream Main, to step Merge: 000381
- to stream Main, to step Curation: 000381
Le document en format XML
<record><TEI><teiHeader><fileDesc><titleStmt><title xml:lang="en" level="a">Extracting Predominant Local Pulse Information From Music Recordings</title>
<author><name sortKey="Grosche, Peter" sort="Grosche, Peter" uniqKey="Grosche P" first="Peter" last="Grosche">Peter Grosche</name>
<affiliation wicri:level="3"><inist:fA14 i1="01"><s1>Saarland University and the Max-Planck Institut fur Informatik</s1>
<s2>66123 Saarbrücken</s2>
<s3>DEU</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
</inist:fA14>
<country>Allemagne</country>
<placeName><region type="land" nuts="2">Sarre (Land)</region>
<settlement type="city">Sarrebruck</settlement>
</placeName>
</affiliation>
</author>
<author><name sortKey="Miiller, Meinard" sort="Miiller, Meinard" uniqKey="Miiller M" first="Meinard" last="Miiller">Meinard Miiller</name>
<affiliation wicri:level="3"><inist:fA14 i1="01"><s1>Saarland University and the Max-Planck Institut fur Informatik</s1>
<s2>66123 Saarbrücken</s2>
<s3>DEU</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
</inist:fA14>
<country>Allemagne</country>
<placeName><region type="land" nuts="2">Sarre (Land)</region>
<settlement type="city">Sarrebruck</settlement>
</placeName>
</affiliation>
</author>
</titleStmt>
<publicationStmt><idno type="wicri:source">INIST</idno>
<idno type="inist">11-0363501</idno>
<date when="2011">2011</date>
<idno type="stanalyst">PASCAL 11-0363501 INIST</idno>
<idno type="RBID">Pascal:11-0363501</idno>
<idno type="wicri:Area/PascalFrancis/Corpus">000007</idno>
<idno type="wicri:Area/PascalFrancis/Curation">000007</idno>
<idno type="wicri:Area/PascalFrancis/Checkpoint">000006</idno>
<idno type="wicri:explorRef" wicri:stream="PascalFrancis" wicri:step="Checkpoint">000006</idno>
<idno type="wicri:doubleKey">1558-7916:2011:Grosche P:extracting:predominant:local</idno>
<idno type="wicri:Area/Main/Merge">000381</idno>
<idno type="wicri:Area/Main/Curation">000381</idno>
<idno type="wicri:Area/Main/Exploration">000381</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title xml:lang="en" level="a">Extracting Predominant Local Pulse Information From Music Recordings</title>
<author><name sortKey="Grosche, Peter" sort="Grosche, Peter" uniqKey="Grosche P" first="Peter" last="Grosche">Peter Grosche</name>
<affiliation wicri:level="3"><inist:fA14 i1="01"><s1>Saarland University and the Max-Planck Institut fur Informatik</s1>
<s2>66123 Saarbrücken</s2>
<s3>DEU</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
</inist:fA14>
<country>Allemagne</country>
<placeName><region type="land" nuts="2">Sarre (Land)</region>
<settlement type="city">Sarrebruck</settlement>
</placeName>
</affiliation>
</author>
<author><name sortKey="Miiller, Meinard" sort="Miiller, Meinard" uniqKey="Miiller M" first="Meinard" last="Miiller">Meinard Miiller</name>
<affiliation wicri:level="3"><inist:fA14 i1="01"><s1>Saarland University and the Max-Planck Institut fur Informatik</s1>
<s2>66123 Saarbrücken</s2>
<s3>DEU</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
</inist:fA14>
<country>Allemagne</country>
<placeName><region type="land" nuts="2">Sarre (Land)</region>
<settlement type="city">Sarrebruck</settlement>
</placeName>
</affiliation>
</author>
</analytic>
<series><title level="j" type="main">IEEE transactions on audio, speech, and language processing</title>
<title level="j" type="abbreviated">IEEE trans. audio speech lang. process.</title>
<idno type="ISSN">1558-7916</idno>
<imprint><date when="2011">2011</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
<seriesStmt><title level="j" type="main">IEEE transactions on audio, speech, and language processing</title>
<title level="j" type="abbreviated">IEEE trans. audio speech lang. process.</title>
<idno type="ISSN">1558-7916</idno>
</seriesStmt>
</fileDesc>
<profileDesc><textClass><keywords scheme="KwdEn" xml:lang="en"><term>Acoustic signal processing</term>
<term>Audio signal processing</term>
<term>Kernel method</term>
<term>Linear prediction</term>
<term>Musical sound</term>
<term>Noise immunity</term>
<term>Onset time</term>
<term>Rhythm</term>
<term>Signal processing</term>
<term>Target tracking</term>
<term>Time variation</term>
</keywords>
<keywords scheme="Pascal" xml:lang="fr"><term>Rythme</term>
<term>Temps établissement</term>
<term>Variation temporelle</term>
<term>Méthode noyau</term>
<term>Prédiction linéaire</term>
<term>Immunité bruit</term>
<term>Poursuite cible</term>
<term>Son musical</term>
<term>Traitement signal audio</term>
<term>Traitement signal</term>
<term>Traitement signal acoustique</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front><div type="abstract" xml:lang="en">The extraction of tempo and beat information from music recordings constitutes a challenging task in particular for non-percussive music with soft note onsets and time-varying tempo. In this paper, we introduce a novel mid-level representation that captures musically meaningful local pulse information even for the case of complex music. Our main idea is to derive for each time position a sinusoidal kernel that best explains the local periodic nature of a previously extracted note onset representation. Then we employ an overlap-add technique accumulating all these kernels over time to obtain a single function that reveals the predominant local pulse (PLP). Our concept introduces a high degree of robustness to noise and distortions resulting from weak and blurry onsets. Furthermore, the resulting PLP curve reveals the local pulse information even in the presence of continuous tempo changes and indicates a kind of confidence in the periodicity estimation. As further contribution, we show how our PLP concept can be used as a flexible tool for enhancing tempo estimation and beat tracking. The practical relevance of our approach is demonstrated by extensive experiments based on music recordings of various genres.</div>
</front>
</TEI>
<affiliations><list><country><li>Allemagne</li>
</country>
<region><li>Sarre (Land)</li>
</region>
<settlement><li>Sarrebruck</li>
</settlement>
</list>
<tree><country name="Allemagne"><region name="Sarre (Land)"><name sortKey="Grosche, Peter" sort="Grosche, Peter" uniqKey="Grosche P" first="Peter" last="Grosche">Peter Grosche</name>
</region>
<name sortKey="Miiller, Meinard" sort="Miiller, Meinard" uniqKey="Miiller M" first="Meinard" last="Miiller">Meinard Miiller</name>
</country>
</tree>
</affiliations>
</record>
Pour manipuler ce document sous Unix (Dilib)
EXPLOR_STEP=$WICRI_ROOT/Wicri/Sarre/explor/MusicSarreV3/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000381 | SxmlIndent | more
Ou
HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 000381 | SxmlIndent | more
Pour mettre un lien sur cette page dans le réseau Wicri
{{Explor lien |wiki= Wicri/Sarre |area= MusicSarreV3 |flux= Main |étape= Exploration |type= RBID |clé= Pascal:11-0363501 |texte= Extracting Predominant Local Pulse Information From Music Recordings }}
This area was generated with Dilib version V0.6.33. |